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ABSTRACT 

We present a formalism for analyzing interferometric observations of the Cosmic Microwave 
Background (CMB) anisotropy and polarization. The formalism is based upon the £-space 
expansion of the angular power spectrum favored in recent years. Explicit discussions of 
maximum likelihood analysis, power spectrum reconstruction, parameter estimation, imaging 
and polarization are given. As an example, several calculations for the Degree Angular Scale 
Interferometer (DASI) and Cosmic Background Interferometer (CBI) experiments are presented. 



Subject headings: cosmology: theory - cosmic microwave background 



1. Introduction 

The Cosmic Microwave Background (CMB) has become one of the premier tools for understanding 
early universe astrophysics, classical cosmology and the formation of large-scale structure. It has a wealth of 
information about the origin and evolution of the Universe encrypted in its signal. The frequency spectrum 
is that of a blackbody at 2.73K, confirming the prediction of the standard hot big bang model (see Fixsen 
et al. 1996, Nordberg & Smoot 199S or the review of Smoot & Scott 1997). The small angular scale 



power spectrum of the temperature and polarization anisotropics contain information on the cosmological 



parameters and the structure that existed at decoupling (see e.g. Bennett, Turner & White 1997). 



The predictions for a wide range of cosmological models are now well understood and theoretically 
secure (Hu et al. 1997; hereafter HSWZp . Experimentally, a flurry of results have been reported in the last 
five years. The study of the microwave sky has moved beyond the "detection phase" and into the "imaging 
phase" , with the next generation of experiments planning to provide detailed information about the shape 
of the angular power spectrum of the temperature anisotropics over a wide range of angular scales. 



The advent of low-noise, broadband, millimeter- wave amplifiers (Popieszalski 1993) has made 
interferometry a particularly attractive technique for detecting and imaging low contrast emission, such 
as anisotropy in the CMB. An interferometer directly measures the Fourier transform of the intensity 
distribution on the sky. By inverting the interferometer output, images of the sky are obtained which 
include angular scales determined by the size and spacing of the individual array elements. In this paper 
we discuss a formalism for interpreting CMB anisotropics as measured by interferometers and examine 
what two upcoming experiments, the Degree Angular Scale Interferometer (DASI) and Cosmic Background 
Interferometer (CBI), may teach us about cosmology. 

Several previous papers have dealt with the analysis of CMB data from interferometers (Martin & 
Partridge 1988; Subrahmanyan et al. 1993; Hobson, Lasenby & Jones 1995; Hobson & Magueijo 1996| ). In 
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this paper we extend the work to make exphcit contact with the multipole space (^-space) methods now 
commonly adopted in analyzing single-dish switching experiments, including power spectrum estimation, 
parameter extraction, imaging, polarization and mosaicing. We also give details of the upcoming DASI 
experiment. 

The outline of the paper is as follows. We begin with a discussion of instruments, past present and 
future, in Foregrounds and point sources are discussed in §|[ The theoretical formalism for analyzing 
temperature anisotropics is outlined in §^ while applications including maximum likelihood estimation of 
parameters and power spectrum reconstruction are treated in Increasing the sky coverage, and hence the 
resolution of the instrument in ^-space is introduced in Making images of the microwave sky is addressed 
in m Polarization is treated in fM Finally, §S contains our summary and discussion. 



Instruments 



The use of interferometers to study fluctuations in the CMB goes back over a decade (see Table |^). 
Early work using the VLA and ATCA concentrated on small angular scales, reporting a series of upper 



limits. Recently the CAT (O'SuUivan et al. 199?;) has reported a detection of anisotropy on sub-degree 
scales, at low frequencies (15 GHz). Several groups are now planning to build interferometers which operate 
at higher frequencies and over a larger range of angular scales, with sensitivities which should enable them 
to map in detail the CMB anisotropy spectrum from £ ^ 10^ to ^ ^ 10^. 

An interferometric system offers several desirable features: 1) It directly measures the power spectrum 
of the sky, in contrast to the differential or total power measurements. Images of the sky can then be 
created by aperture synthesis. 2) Interferometers are intrinsically stable since only correlated signals are 
detected; difficult systematic problems that are inherent in total power and differential measurements are 
absent in a well designed interferometer. This considerably reduces signals due to ground pickup and near 
field atmospheric emission. 3) They can be designed for continuous coverage of the CMB power spectrum 
with angular spectral resolution determined by the number of fields imaged. 

Motivated by these attributes, several groups are developing new interferometers targeted at measuring 
the anisotropy in the CMB on sub-degree angular scales. Three instruments are currently under construction: 
the Very Small Array (VSA) in Cambridge, the Degree Angular Scale Interferometer (DASI) and the Cosmic 
microwave Background Interferometer (CBI). DASI and CBI are parallel projects based at Chicago and 
Caltech. In this paper we will concentrate on the DASI instrument, though the formalism is completely 
general and applies equally to any interferometer. 

DASI is an interferometer designed to measure anisotropies in the CMB over a large range of scales with 
high sensitivity^. The array consists of 13 closely packed elements, each of 20 cm diameter, in a configuration 
which fills roughly half of the aperture area with a 3-fold symmetry (see Fig. |l|). Each element of the array is 
a wide-angle corrugated horn with a coUimating lens. DASI uses cooled HEMT amplifiers running between 
26-36 GHz with a noise temperature of < 15K. The signal is filtered into ten 1 GHz channels. DASI will 
operate at the South Pole. 



^More information on DASI can be found at 



tittp: //astro .uchicago . edu/dasi . 
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Name 


Location 


A^dish 


Freq. (GHz) 


Bandwidth (GHz) 


Primary beam 


e 


OVRO^ 


United States 


6 


30 


2.0 


4' 


6750 


VLA^ 


United States 


27 


8 


0.2 


5' 


6000 


Ryle'^ 


England 


8 


15 


0.4 


6' 


4500 


BIMA'i 


United States 


10 


30 


0.8 


6' 


4300 


ATCA'' 


Australia 


6 


9 


0.1 


8' 


3400 


T-W* 


United States 


2 


43 




2° 


20-100 


CATS 


England 


3 


13-17 


0.5 


2° 


339-722 


VSAi^ 


Canary Islands 


15 


26-36 


2.0 


4° 


130-1800 


DASr 


South Pole 


13 


26-36 


10.0 


3° 


125-700 


CBP 


Chile 


13 


26-36 


10.0 


44' 


630-3500 



Table 1: Current experiments to measure CMB temperature anisotropies with interferometers. There are 
published upper limits from VLA, Ryle and ATCA. The CAT has published a detection while VSA, DASI and 
CBI are expected to begin operations around 1999-2000. The location, number of dishes/horns, frequency 
and (approximate) coverage in £ space are listed. 



"Carlstrom, Joy & Grego 1996, ApJ, 461, L59 

''Fomalont EB., et al., 1984, ApJ, 277, L23; Knokc JE., et al., 1984, ApJ, 284, 479; Martin HM. & Patridge RB., 1988, ApJ, 
324, 794; Fomalont EB., et al., 1988, Astron J., 96, 1887; Hogan CJ. & Partridge RB., 1989, ApJ, 341, L29; Fomalont EB., et 
al., 1993, ApJ, 404, 8.; Partridge RB, et al., 1997, ApJ, 483, 38 
Jones, ME., 1997, PPEUC proceedings, Cambridge, April 7-11 

'^Cooray, AR. et al. 1997, AAS, 191 1906 

'^Subrahmanyan R., Ekers R D., Sinclair M., Sil k J., 1993, MNRAS, 263, 416; Subrahmanyan R., Kesteven MJ., Ekers RD., Silk 

J., 1998, MNRAS, in press | |astro-ph/9805245 | 
■fTimbie P.T., Wilkinson D.T., 1990, ApJ, 353, 140 
sO'SuUivan C, et al., 1995, MNRAS, 274, 861 

''see Jones, ME., 1996, in Proceedings of the XVIth Moriond Meeting, ed. F. Bouchet et al., p. 161 

'see Halverson et al., 1998, ASP Conf. Proc, "Astrophysics from Antarctica", ed. G.Novak & R.Landsberg, in press 

^ http; / /astro.caltech.edu/~tjp/CBI/ 



3. Foregrounds 



In order to estimate the contribution of primordial fluctuations to the observed signal, it is necessary 
to estimate the contribution from foreground contaminants. Some reviews of the situation with regards 
astrophysical foregrounds can be found in Brandt et al. ( |1994| ), Toffolatti et al. ( 1994 , 1997) and Tegmark 
& Efstathiou (1996). We refer the reader to these papers for more details, and lists of references. Below we 
summarize some of the more important points for the DASI experiment. 
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3.1. Atmosphere 



The far fieldg of a very compact array is actually quite near the instrument; for DASI this distance 
is only a few hundred meters. Thus everything beyond this distance, including the atmosphere, will be 
imaged by the interferometer. Taking a standard model for the static brightness of the atmosphere with a 
temperature T and zenith opacity n, the brightness varies with zenith distance 9 as 



T 1 



exp 



cos(6l) 



(1) 



Its effect thus appears as a constant term plus a slope and a very slight curvature. Since the interferometer 
rejects low spatial frequencies, this atmospheric contribution is negligible in the final image. 

The dynamic atmosphere causes a fluctuating brightness we must look through, and if it is in the far 



field it will be correlated and appear as excess noise (Church 1995). The Python V experiment observed 
from the South Pole during the austral summer of 1997 at a frequency of 45 GHz and covered angular scales 
comparable to those of DASI. From the level of atmospheric noise, we estimate that the atmosphere will 
contribute only about 10% to the total system noise (Lay et al., 1998, in preparation). 



3.2. Galactic Foregrounds 

3.2.1. Synchrotron 



Due to its low operating frequency, the main foreground which DASI will need to contend with is Galactic 
synchrotron radiation, produced by electrons spiraling in the galactic magnetic field. The specific intensity^ 
of synchrotron emission roughly follows a power-law in frequency /^■^"'^ cx with spatially varying index and 
amplitude (Lawson et al. 1987, Banday & Wolfendale 1991, Platania et al. 1998). The index, /3sync, varies 
from -0.1 to -1.3 with a mean of -0.8. There is some evidence that the index steepens at higher frequencies. 
The angular power spectrum ~ for I < 10^ (see Eq. (^ for a definition of Ci). 



3.2.2. Free-Free 

Free-free emission, also known as bremsstrahlung, is due to scattering of unbound particles, typically 
electrons off nuclei e.g. ep — > ep^. The spectral index (3s depends on the temperature and density of the 
charged particles, but is in the range -0.13 to -0.16 for typical electron density and temperature values for 
the ISM (Bennett et al. 1992| ). At high galactic latitudes, free- free emission is expected to dominate over 



synchrotron emission at around 40 GHz (Bennett et al. 1992 ). No direct maps of free- free emission exist, 
though there is a possible correlation between free- free emission and Ha emission (Bennett et al. 1992). 
Since the Ha maps contain striping the significance of the correlation is not easy to assess. If there is a 
strong correlation the free-free spectrum can be predicted from Ha measurements at galactic latitude 20 



(Reynolds 1992| ) plus fundamental physics to determine /Jg. There is evidence however (Kogut et al. 199(^ , 

^The farficld distance is 2B^/A where B is the longest baseUne of the interferometer, 1.1m for DASI, and A is the wavelength. 
^To obtain the spectral indices in terms of antenna temperature = Iv / {2kv'^), or 5Ta = x^e^/(e^ — 1)^5T, one subtracts 
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Leitch et al. |1997| , de Oliveira-Costa et al. |1997| , Kogut |1997| ) that free-free emission may be correlated with 
dust emission near the NGP, and that this "hot" (10^ — lO^K) component may not emit Ha (for an alternative 
explanation of the correlation in terms of spinning dust grains, see Draine & Lazarian 199^ ). A correlation 
between free-free emission and dust would imply Ct ~ f-^ '^ to ^"^ (Schlegel et al. 09981 Wright |l998|). 



3.3. Extragalactic Foregrounds 



The fluctuations from extragalactic sources have been modeled by Toffolatti et al. ( 1994 , 1997 ) and 
Franceschini et al. (1991). The source models are robust below 100 GHz though uncertain to almost an 



order of magnitude well above 100 GHz. 

The angular dependence of uncorrelated point sources is of course that of white noise: Cg ^ . There is 
some evidence that radio sources are correlated (Peacock & Nicholson 1991 ) , but the non-Poisson contribution 



to the anisotropy is always smaller than the Poisson contribution (Toffolatti et al. |1994 |1997[) . Providing 
the sources exist over a large range of distances from us, any correlation in the sources at small scales is 
significantly diluted by projection. 

We show in Fig. ^ the angular power spectrum, l(i -f \)Ci ^ associated with point sources at 
30 GHz, assuming that we subtract all sources brighter than SOmJy. We have taken the luminosity function 



of VLA FIRST (Becker et al. |1996D radio sources at 1.5 GHz from Tegmark & Efstathiou ( [1996^ Eq. 43) 
and extrapolated all the sources assuming ~ v~°' . For a > it is below the expected cosmological signal 
for the range of scales probed by DASI. Point source subtraction for DASI will be facilitated by using the 
ATCA to map the DASI observing region at 16-26 & 43 GHz (the DASI observing region also overlaps with 
regions observed by Python and planned observations by Boomerang and Beast). 



3.4. Foreground Subtraction 

We will for definiteness here consider the DASI instrument, though our general conclusions will hold 
for the other planned instruments with similar frequency coverage. In the absence of external information 
about foreground emission we can use the 10 GHz bandwidth of DASI to marginalize over the unknown 
amplitude of a foreground component. Since we are working at low frequency the dominant foregrounds are 
synchrotron and free-free emission, which have similar spectral indices. Thus one is led to consider fitting out 



a single component. Using the formalism of Dodclson (1997) and 5 frequency channels centered at 27, 29, 
31, 33 and 35 GHz we find that the error bars on the CMB component are increased by a factor of 4.2 (5.1) 
if we project out a synchrotron (free-free) foreground with (temperature) spectral index -2.7 (-2.2). Using 
10 X 1 GHz channels doesn't change this number significantly. With an improved foreground extraction 
method it may be possible to reduce this by roughly a factor of 2 (Tegmark 1997a, White 199§| ). 



The increase in the error bar is due to the small operating band of DASI, and indicates that we 
would like to use external information (foreground maps) when interpreting the DASI observations. As 
an example, assumptions about the spatial properties of the foregrounds (e.g. their ^-space power spectra) 
can improve the separation of foreground and signal. The DASI will originally operate in a region known to 
be low in foregrounds, but the need for a large sky coverage will eventually require operation in regions 
with significant foregrounds. Due to the dependence on the region of sky surveyed, we will focus on 
the instrument characteristics from now on, and not include the increase in the error bars expected from 
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foreground subtraction. 



4. Formalism 
4.1. The Visibility 

An interferometer measures (£'i-Z5'|) where Ei and E2 are the electric field vectors measured by two 
telescopes pointing to the same position on the sky and (• • •) represents an average over a time long compared 
with the period of the wave. Assuming for now a monochromatic source of radiation and working in the 
Fraunhofer limit, the average of the product of electric fields is the intensity times a phase factor. For each 
point source, the phase factor is the exponential of {i times) the geometric path difference between the source 
and the two telescopes, in units of the wavelength. Taking the integral over the source/emitter plane gives 
the Fourier Transform of the observed intensity (i.e. the sky intensity multiplied by the instrument beam). 

The fundamental observable for the interferometer is thus a "visibility" , which is the Fourier Transform 
(FT) of the sky intensity multiplied by the primary beam or aperture function (Tompson, Moran & 
Swenson |1986| ): 

V{u) (X [ dx A(i)Ar(i)e2™-* (2) 



where AT is the temperature (fluctuation) on the sky, i; is a unit 3-vector and u is the conjugate variable, 
with dimensions of inverse angle measured in wavelengths. A{x) is the "primary" beam and is typically 
normalized to unity at peak, which is ^/2na larger than the usual normalization of a gaussian beam. (By 
requiring A{0) = 1 we ensure that the area of the aperture in the u plane is unity.) The spacing of the horns 
and the position of the beam on the sky determine which value of u will be measured by a pair of antennae 
in any one integration. The size of the primary beam determines the amount of sky that is viewed, and 
hence the size of the "map" , while the maximum spacing determines the resolution. 

Typically the field of view of the interferometer is small, so in what follows we will make a small angle 
approximation and treat the sky as flat (see also the Appendix) . This is a very good approximation for the 
upcoming experiments (CBI, DASI, VSA) and leads to significant simplification in the formalism. If the 
primary beam A(x), or more generally the area of sky surveyed, is well localized the integral is only over a 
very small range of x. Denoting the center of the beam by xo we can write x = xq + xj_ with x±_ <^ xq = 1 
and xj^ • Xq = 0, then is a 2D vector lying in the plane of the sky. We will denote 2D vectors by boldface 
type, and vectors in 3D or other spaces by arrows. A vector name which is neither boldface nor arrowed 
indicates the length of that vector, e.g. x± = |x^|. 

Finally we should remark on one subtlety in the statistical analysis of a "random" component such as the 
CMB fluctuations. It is common to assume that the temperature fluctuations in the CMB are realizations of 
a random field, so that T(x) is a real random field on (see below). Since V{u) is the Fourier Transform 
of r(x), it is also a random field, however it is complex. Since T is real, it follows that V*{u) = V{—u) and 
thus V{u) is a complex random field with independent degrees of freedom only over the half-plane (i.e. twice 
as many as T over half the area). This restriction to the half- plane will be important when it comes to 
parameter estimation. An alternate formulation of the problem, which turns out to be equivalent, is to 
define a "real" visibility in terms of cosine (sine) transforms whenever e.g. u^: > {ux < 0). We will not give 
this parallel development here, as it is exactly equivalent to the complex case we will discuss. 
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4.2. The Sky Power Spectrum 

To proceed, notice that in the smaU field of view approximation the visibihty is the convolution of the 
FT of the sky intensity (temperature) and the FT of the primary beam. Thus if we knew the power spectrum 
of the sky, we could find the power spectrum of the visibilities by convolution with the Fourier transform of 
the primary beam (see below). In this section we will concentrate on the sky power spectrum and neglect 
the effect of the primary beam, and so we set A ^ 1 for now. 

We usually assume that our theory has no preferred direction, i.e. it is rotationally invariant. In the flat 
sky approximation this rotational invariance becomes a translational invariance on the plane. This means 
that the (double) FT of the sky correlation function becomes diagonal in u, i.e. "conserves momentum". 
(We discuss the flat sky approximation further in the Appendix.) We shall call the diagonal part the sky 
power spectrum S{u) — S{u), not to be confused with a flux. 

The ability to perform Fourier analysis on the "flat" sky and the replacement of rotational by 
translational invariance is the principle advantage of the flat sky approximation. These advantages are 
only obtained in the small-angle limit, regardless of how one chooses to map angle into m, so the reader 
should beware of "improvements" to the flat sky approximation except under very special circumstances. 

We can write the FT of the correlation function, depending on u and w, as 

J dxi dii.2 C(xi • X2) exp [27rm • (xi — X2)] exp [27ri(w — u) • xi] , (3) 

where C(cos6') is the (dimensionless) correlation function for the CMB temperature fluctuations, deflned 
below. If we expand 

oc 

g2,r™.x ^ jp(27ru) + 2 ^ J„(27ru) cos (marccos(u • x)) (4) 

m— 1 



and use the symmetry of the problem to do the angular integrals we find (e.g. Subrahmanyan et al. 199S ) 
that the diagonal part 

S{u) cx / uduj C{uj)Jo{2nuLj) (5) 
Jo 

where uj = |xi — X2I = 2sin(0/2) and d{cos9) = ujduj. In the fiat space limit we extend the upper limit of the 
Lu integration to 00. Expanding the correlation (or 2-point) function for the CMB temperature fluctuations 
as a Legendre series 

/ AT AT \ 1 °° 
C(xi .X2) ^ / — (xi) — (X2)) = -5](2£+l)QF,(xi .X2) (6) 



and using Gradshteyn & Ryzhik (198C; Eq. 7.251(3)) we obtain 



S{u) = V(2£ + 1)Q J2£+i(47ru). (7) 
2t:u ^-^ 

t 

In evaluating this sum, care must be taken for regions of the spectrum where is nearly scale invariant due 
to significant cancellations. 

For large £, J2t+i is a sharply peaked function. Thus the 2D power spectrum u'^S{u) ~ £{£ + l)C'£|^=27i-u- 
Direct numerical evaluation of Eq. (^ or requiring the RMS fiuctuation at zero lag to be the same in u space 
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as ^ space (Gradshteyn & Ryzhik 1980| , Eq. 6.511(1)) allows us to write 



(27r)2 



Cf 



for u > 10 



(8) 



?=2irM 



This approximation works at the few percent level for standard CDM (sCDM) when w ;> 10 or ^ ^ 60. 

We show S{u) vs. u for a selection of CDM models normalized to the COBE 4-year data in Fig. ||. The 
range of angular scales that will be probed by DASI and CBI are shown as the solid lines across the top of 
the figure. While these models were chosen primarily to fit the large-scale structure data, all of the models 
shown provide reasonable fits to the current CMB data. The parameters for the models are given in Table ^. 



4.3. The Visibility Correlation Matrix 

In theories which predict gaussian temperature fluctuations the fundamental theoretical construct is the 
correlation matrix of the measured data. Since the data in our case are the visibilities measured at a set 
of points Ui, in what follows we will need to know the correlation matrices for the signal and noise in the 
various visibilities. The measured fluxes, V{u), are 

9B.^ AT(x) 



Vin) = — f TcMB / dx — ^ A(x)e^— (9) 

J 1 CMB 

where dB^/dT converts from temperature to intensity, Tcmb is the CMB temperature and A(x) is the 
primary beam. The conversion factor from "temperature" to "intensity" is 

dB^ 2kB fkBT\^ x^e"" _ f 99.27 3y st-^\ x'^e' ^^^^ 



dT c'^ \ h ) (e^-l)2 V y(e^-l)2 

where B^ is the Planck function, ks is Boltzman's constant, x = hv/kBT^MB — t^/56.84 GHz is the 
"dimensionless frequency" and 1 Jy^lO^^e -y^^/m^/Hz. In the Rayleigh- Jeans limit dB^/dT ~ 2kB{v/cY 
where v is the observing frequency. This is a good approximation for the frequencies of planned 
interferometers, the correction to the Rayleigh-Jeans assumption is 2% at 30 GHz, and we shall make it 
henceforth. We shall also set c = 1 so = A^^. 

The Fourier Transform of the primary beamQ is the auto-correlation of the Fourier Transform of the 
point response, g, of the receiver to an electric field, A{u) = g-kg[u) and 



A(x) = J du A(u)e-^"*"-^ , (11) 
so using the fact that the power spectrum is diagonal in u we have 

CYj = {V*{u,)V{u,)) - {2kBTcyiBi^^f j d^w I*(u, - w)I(u, - w)5(|w|) . (12) 

Notice that the visibilities are uncorrelated if |ui — Uj| is larger than (twice) the width of A, which defines 
the bin size Au. 



* Throughout we will use a tilde to represent the Fourier Transform of a quantity. 
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We are now in a position to define the window function Wij (u) which, when convolved with the power 

V 



spectrum, defines the visibility correlation matrix C^^ . From the above 



/>OC 

CYj = {2kBTcMBi^^y / wdw S{w)W^, 
Jo 



H (13) 



with 



Wy(|w|)= / de^ A*{u,-^v)A{uj-w) . (14) 







Note that since A has compact support the maximum of Wu scales as u^^ for Ui 3> Au. Since the noise per 
visibility is independent of Ui the signal-to- noise drops as for a scale-invariant spectrum, S{u) oc 
Also note that both Wij and S{u) are positive semi-definite, so the visibilities are never anti-correlated, 
unlike single-dish (chopping) experiments. 

In general the distribution of the electric field in the horn aperture is close to a pillbox times a Gaussian. 
Due to the finite aperture A has compact support. Usually the response is independent of u: we will call the 
|u| for which A vanishes Am. In order to obtain a simple estimate of our window function it is a reasonable 
first approximation to take A equal to the auto-correlation of a pillbox of radius D/2 where D is the diameter 
of the dish. Specifically 



I(u) = ^ 

TT 



arccos — 



(15) 



if u < D and zero otherwise. Thus in this simple example Au = D. If we require v4(0) = 1 then this must 
integrate to unit area, so A~^ = 7r(i)/2)2, or the area of the dish. 

In the case where all correlated signal is celestial, the correlation function of the noise is diagonal with 

V TIaAb ) Ai^taUb 



= ( ) 1^ h ■ (16) 



Here T^ys is the system noise temperature, A the wavelength, 77^ is the aperture efficiency, Ad is the physical 
area of a dish (not to be confused with A(x)), Ub is the number of baseline^ corresponding to a given 
separation of antennae, A^ is the bandwidth and ta is the observing time. Typical values for DASI are 
Tsys = 20K, -qA ~ 0.8, dishes of diameter 20 cm, Uh — i and A^ — 10 GHz (in 10 x 1 GHz channels). For 
CBI the dishes are 5 times larger with the other numbers about the same. We show in Fig. ^ the diagonal 
entries of and for one pointing and 1 day of observing with DASI. 

Using Eqs. (|l^, [l^) we can provide a rough estimate of the signal-to-noise expected in a given visibility. 
For A given by Eq. ( |l5[ ) we have 

which given the numbers above yields Cfi /C^ ~ lO'^ {Au/uf for 1 day of integration, assuming a COBE 
normalized, scale-invariant spectrum. 



^The number of baselines formed by n,- receivers is n;, = nr{nr — l)/2. 
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5. Comparison with Theory 

5.1. Likelihood function 

In stochastic theories AT(i) is a (gaussian) random variable with zero mean and dispersion given by 
the Cf. Thus the visibilities measured by the interferometer will be gaussian random variables with zero 
mean and dispersion + . For a given set of measured visibilities one can test any theory, or set of 
{Cf}, by constructing the likelihood function (for complex variables V) 

^ ({^^}) = -^^^h^ '^^p [~y*i^^Wy{^o)\ (18) 

where Cij = Cfj + C^^ is the correlation matrix of visibilities at and Uj (Hobson, Lasenby & Jones 
1995| ). Note that the visibilities are complex and thus the likelihood function is slightly different than for 



the case of real gaussian random variables, e.g. the "missing" factor of i in the exponent. The restriction 
to the half-plane however ensures that the number of degrees of freedom is the same as for the real case 



(c.f. Hobson, Lasenby & Jones 1995) 



Name f^mat h Vl^h? n 

sCDM i (15 0.0125 T 

OCDM 0.5 0.6 0.0200 1 

tCDM 1 0.5 0.0250 0.8 



Table 2: The cosmological parameters for the theories discussed in the text. All models have JIa — and 
have been normalized to the COBE DMR 4-year data. 

Given a set of data {Vi} one can proceed to test theories using the likelihood function. Confidence 
intervals for parameters and relative likelihood of theories are calculated in the usual way. 



5.2. Power Spectrum Estimation 



There are several ways one could consider estimating the angular power spectrum from a set of visibilities. 
Conceptually the simplest is to average |F(u)p in shells of constant This gives a (noised biased) estimate 
Ci at £ = 2t:u, convolved with the window function Wij. 

A more sophisticated method is to define as a "theory" a set of bandpowers, i.e. define the power 
spectrum £{i+ l)Ci as a piecewise constant, in A^band bands B. One then maximizes the likelihood function 
for this "theory", the result is the best fitting power spectrum, binned into groups at similar £. Such an 
approach has been used on the COBE 4-year data by Gorski ( 1996| ) and Bunn & White ( 1996 ), and discussed 



extensively by Bond, Jaffe & Knox (1998). It is the method we shall advocate here. 



As pointed out by Bond, Jaffe & Knox (1998), it is particularly simple to find the maximum of the 
likelihood function for such a "theory" using a quadratic estimator (see also Tegmark I997b| ) which takes 
as an input a trial theory. In this case the theory is a set of "bandpowers" chosen to cover the £ range of 
interest and be approximately independent. Iteration of the quadratic estimator is equivalent to Newton's 
method for finding the root of dC/dp where p is a parameter on which C depends. 
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Specifically from an estimate of the bandpowers, u'^S{u) — Pa for u £ (where a = 1, • • • , A^band) an 
improved estimate is Pa — Pa + Spa with 



Spa = (tr 



c^^Ca c-^c. 



tr 



(19) 



(Bond, JafFe & Knox 1998| ). Here C indicates the (theoretical) correlation matrix evaluated at the initial 
estimate Pa with C,q = dC/dpa and C = V*{ui)V{uj) indicates the matrix formed by the data. Iteration 
of this procedure (e.g. from an initially flat spectrum pa ^constant) converges to the maximum likelihood 
estimate of the power spectrum. 

Since the parameters Pa are chosen to be the (constant) values of u^S{u) across the band and the 
noise is assumed to be independent of the level of cosmological signal 



— W,,{w) 

Ba, ^ 



(20) 



We may gain some intuition for this expression by considering the simple case of 1 band and uncorrelated 
visibilities. Assume additionally that Wij(u) is independent of u. Then if we write C = afwiSij and 



C a — WiSij we have 



(21) 



This becomes even simpler if all of the visibilities are at fixed ju^j. Then aY is independent of i and, if we 
assume the noise is also, Sp becomes zero when 



1 ^ 



(22) 



which is reminiscent of our simplistic estimate described at the beginning of this section. 



5.3. CDM Parameter Estimation 



It has become common (Scott & White 1995 , Jungman et al. 1996| , Bond, Efstathiou & Tegmark 1997] , 
Zaldarriaga et al. 1997, Stompor & Efstathiou 1998) to ask how well we could measure theory parameters 
"on average" given a set of measurements {Vi} which are "typical". Imagine that our theory is defined by 
a set of parameters {pa} and that the sky corresponds to this theory with values of parameters Pa- In this 
case 



ln£) = tr 



cc- 



InC 



(23) 



where Cij = {V*Vj) denotes Cij{p). The precision to which we can measure {pa} assuming an input theory 
C is given by the second derivative matrix of (— ln>C) 



a2(-in/:) 



dpadpp 



= tr 



with = dC/dpa- Thus 



{{Pa - {Pa)){pp - {pp)))c = {tr [c-ia„ a-^a^] } ' 



(24) 



(25) 
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where (• • •)£ denotes an average with respect to the likehhood function H. 

As an example of how well planned interferometers will constrain cosmological models we can ask how 
well DASI would be able to determine the cosmological parameters ya— {C'lO: f^mati f^B^^i for the 
adiabatic CDM models listed in Table |^. We have not included the ionization history in this list since the 
amplitude and the ionization history are degenerate on these scales for late reionization. We will also assume 
that r^A — 0. We choose the value of at ^ = 10 for our normalization parameter as this is approximately 
the "pivot point" of the COBE data. 

In order to avoid a detailed modeling of the DASI observing strategy before the instrument is operational, 
we shall simply assume that DASI will fully sample the u — v half-plane between m = 25 and 110. The number 
of different u needed to cover the u — v ring at u if the aperture has a radial width 2Au is N(u) — 2u/ lS.u^ 
where we have made use of the fact that only the /la/f-plane is covered. (This is actually conservative - 
points do not become correlated as soon as the A's touch. We would want to sample a factor of ~ 2 more 
densely than this and keep track of correlations, but we shall postpone an optimization to a later paper). 
The DASI has continuous sensitivity in |u|. Thus we assume that we can fully cover the u — v plane with 
A^(i*max) pointings (orientations) of the instrument. We spend equal time on each orientation. 

To simplify this computation we will count only the independent visibility measurements arising from 
such a strategy, i.e. ignore the extra information that will be available from the correlations. Specifically 
we choose such that there are A'bin independent bins in the radial (|u|) direction and Af(|ui|) bins in the 
angular {u) direction. Eq. ( p5| ) in the case that is diagonal reduces simply to 



where we have assumed the noise is independent of Pa ■ The width of the aperture Au is chosen so that 
= 27VbinAu. To obtain this resolution in u requires increasing the sky coverage either by 
shrinking the dish size or by mosaicing (see We shall assume that if mosaicing is used then the spherical 
symmetry of A is preserved. As discussed in §|^, this will only be an approximation. A full analysis of the 
strategy including mosaicing, which takes into account the deviations from the monochromatic and "flat 
sky" approximations and includes the correlations between visibilities remains to be done. The number of 
bins is used here as a measure of the sky coverage achieved by the experiment. For reference A^bin = 20 
corresponds to a 10'^ square degree circle of sky. 

We shown in Table |^ the relative uncertainties on pa , with a "prior" 20% uncertainty in Cio . (We do 
not include the increase in the errors from foreground subtraction here.) The number of bins is fixed at 
A'^bin — 20, or lO'^ square degrees of sky coverage, for simplicity. For 3 months of observing an sCDM sky 
the DASI would be system noise limited for u ^ 50. Note that there is a strong dependence of the estimated 
errors on the input theory (and the parameter set chosen). We have chosen the 3 theories here to explore 
this dependence. For the open model (which has flo = 0.5) the peaks in the power spectrum are at higher 
u than the critical density models (see Fig. ||). This means that there is less information (from the higher 
peaks) available to break the degeneracy in parameter variations, leading to larger uncertainties when the 
other parameters are integrated out. To increase the precision with which cosmological parameters could 
be determined in an open model, one would need to extend the coverage to higher |u| using CBI. If this is 
done the angular scale of the features in the open model is better matched to DASI+CBI than in the flat 
models, and the parameters can thus be better determined. For the tilted model the signal-to-noise is lower 
at large u, which accounts for the slightly larger uncertainties on e.g. the spectral slope n. However the 




(26) 
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higher baryon fraction leads to greater sensitivity to CIb, and the variations with Qmat are less correlated 
with f^B- 





3 months 


6 months 


12 months 


Name 


^mat 
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^mat 
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Ob 


n 


Omat 
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sCDM 


21 


40 


50 


9 


15 


26 


34 


7 


12 


19 


27 


5 


OCDM 








62 


73 






45 


53 




91 


33 


tCDM 


30 


71 


40 


16 


17 


39 


25 


10 


11 


23 


17 


7 



Table 3: The percentage relative uncertainties for the cosmological parameters assuming 3, 6 or 12 months 
of integration of the DASI over 10^ square degrees (A'bin = 20), for the theories discussed in the text. We 
have included a "prior" corresponding to 20% uncertainty in Cio- For the open models several parameters 
are very correlated because DASI alone does not probe to high enough u. For sCDM with A^bin = 20 and 
Tgys = 20K we are noise limited for u ^ 50 with 3 months observing time. Once DASI and CBI are combined 
the uncertainties on all parameters for all models are < 20% for 1 year of data. 

For all of these theories the uncertainties are larger due to the large correlations between parameters: 
if the bin size or noise is too large then one cannot distinguish between variations in different parameters, 
inflating the marginalized errors. To decrease the correlations one must increase the resolution in u, which 
means obtaining more sky coverage, or increase the total range of u covered, which means combining DASI 
and CBI. The combination of DASI and CBI is particularly powerful, with the marginalized errors for each 
model being below 20% for all parameters within 1 year of observing. 



6. Mosaicing 

6.1. Increasing Resolution 

The resolution which wo have in u-space is limited by the amount of sky that we have surveyed, which 
for a single pointing is equal to the size of the primary beam. By combining several contiguous pointings of 
the telescope we can increase the amount of surveyed sky and therefore increase the resolution in u-space. 
This is known as mosaicing. The idea here is to measure the visibility as a function of position y from the 
map or phase center 

Fy(u) oc J dPx A(x - y)T(x)e2'^™-'' (27) 

We then sample at a series of points by repointing the entire telescope. Let us denote this by a sampling 
function n(y) which will be a sum of delta functions. We compute the Fourier Transform of n(y)V^ which 
is simply the convolution {U.-kV}{\) where 

y(u,v) =I(v)f(u + v) . (28) 

Here u is the original vector in the u — v plane, and v is the variable conjugate to y. 

If our sampling function is a sum of delta functions then 11 is a sum of plane waves. Thus simply 
summing our Vy at points yj with weight Cj changes our aperture in u-space from A to 

^c.e-^-^--''^ I(v) (29) 

3 
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which can be made much narrower. This is completely analogous to the usual case of Fraunhofer diffraction 
through many holes, treated in most textbooks on optics. The simplest example is when Cj = 1 and the 
lie on a regular N x N grid with spacing 6 then 



e = exp 



sin [N2ttVx:Sx] sin [N2TrVySy] ^^^^ 
sin [27r?;3;(5x] sin [2TrVySy] 



The second diffraction spike occurs at Vx,y = S^y- If we choose 6x,y sufficiently small (Nyquist sampling) 
then this will be outside the range where A vanishes^. Fig. || shows the gain for a 3 x 3 mosaic. 

In the absence of a preferred direction in the theory, the power spectrum of the sky is symmetric in u, so 
choosing a mosaicing strategy based on a cartesian grid is not optimal. We would like to have the mosaicing 
maximally rotationally symmetric within the observing constraints. In Fig. |^ we compare the 3x3 square 
to 7 points laid out at the center and vertices of a regular hexagon, with all points equidistant from their 
neighbors. 

Note that mosaicing does not increase the range of u to which we are sensitive (see Eq. p8| ), it simply 
enhances our resolution by allowing us to follow more periods of a given wave. Thus we retain our ability to 
reject long-period noise or foregrounds. 

If the goal were simply a measurement of the power spectrum, it is just as efficient to use smaller 
telescopes (with intrinsically better u resolution) and integrate for longer as it is to use large dishes and 
mosaic to increase the resolution. The advantage of the later method is that it allows better imaging of 
each piece of sky for checks of systematics, non-gaussian features and foregrounds and it is easier to avoid 
"dropouts" in the u coverage for the power spectrum. 

There are two routes to analyzing mosaiced data. The first is to treat the visibilities Vy^ (u^) as separate 
data, highly correlated in a calculable way, with apparently low resolution but much information in the 
correlations between visibilities. The theory correlation matrix of mosaiced data can be written in the form 
of Eq. (^) with a modified window function which allows different pointing centers for each visibility 

/■27r ^ ^ 

Wij(|w|)=/ d6l^ A*(ui - w)^(uj - w)e2'^*[("^-^'-yj-("'-^)-y'l . (31) 
Jo 



We discuss this further in S3.2 below 



The other route is to statistically weight the V{ui) from the different to form a synthesized data set 
with fewer visibilities and correlations and intrinsically higher resolution. This method is perhaps simpler 
to understand but the weighting of the different yj will probably not be optimal for parameter estimation 
or power spectrum estimation. 



6.2. Optimal Subspace Filtering 

A set of data from several (mosaiced) pointings involves many measured visibilities, very few of which 
are independent. Thus likelihood analysis requires repeated inversion of a large matrix, which can be 
computationally quite slow. One method for increasing the efhciency of the calculation is to work with a 



^If we sample at precisely the spatial Nyquist rate, gain variations from stare to stare will also alias power outside of the 
spatial frequency window. 
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subset of the data which contains most of the signal, but has fewer elements. This transformation can be 
accomplished using an "optimal subspace filter" . 

Consider first the case of a single (unmosaiced) pointing. For the DASI the visibilities will be measured 
at a specified set of u at any one time, and then the instrument will be rotated about an axis through the 
center of the aperture plane to obtain a different set of u with the same lengths but different orientations. 
In this way most of the u — v plane will be covered allowing for imaging. The number of rotations will be 
chosen to almost fully sample the u ~ v plane at the highest resolution (largest baseline or largest u). This 
means that considerable oversampling of the smaller baselines must result, since the aperture A has a fixed 
width Au. Phrased another way, the correlation matrix of our visibilities has dimension ^ (limax/Aw)^. 
However only a fraction of the entries are uncorrelated and contain "new" information about the theory. 
The rest are primarily a measure of the noise in the experiment. Thus if we could perform a change of basis 
to those combinations which measure primarily signal and those which measure primarily noise we could 
work in a much smaller subspace of the data (the "signal" subspace) with little loss of information about the 
theory. In calculating the likelihood function we need to invert Cy . Since matrix inversion is an process, 
reducing the amount of data can dramatically decrease the processing time. 

As a simple example of this technique, imagine that we measure a visibility twice, with independent 
noise in each measurement. The signal in these two measurements is totally correlated. The sum of the two 
measurements is primarily sensitive to the signal (with noise 1 / \/2 of each individual measurement) while 
the difference measures primarily noise. Thus if we "change basis" to the sum and difference of the two 
measurements we could work with the sum only with little loss of information. This would halve the size of 
the matrix we would need to invert in the likelihood function and speed processing by a factor 8. 

Optimal subspace filtering (also known as the signal-to-noise eigenmode analysis or Karhunen-Loeve 
transform) is a method designed to estimate the best change of basis and subspace in which to work. 



Recent applications of this method to the CMB and large-scale structure can be found in (Bond 1995, Bunn 
& Sugiyama |1994 Bunn, Scott & White |1995| , White & Bunn ]l995b| , Bunn & White 



1996, Vogeley & 



Szalay |1996| , Bond & Jaffe |1996| , Tegmark, Taylor & Heavens |1997| and Tegmark, et al. |1998D . In the above 
example Cij = 1 for all i,j. This matrix has eigenvalues 2 and and (orthonormal) eigenvectors (1, l)/\/2 
and (1, — 1)/\/2 respectively. The eigenvalues measure the signal-to-noise carried by the combination of data 
points d ■ ^a, where ipa are the (orthonormal) eigenvectors. 

In general it can be shown that to find the optimal subspace on which to project one finds the eigenvalues 
and eigenvectors of the matrix cr~^Cija~^ where ai is the noise|^in measurement i. Since Cij/aiaj is positive 
definite and symmetric finding the eigenvalues and eigenvectors is straightforward (e.g. by Jacobi transforms). 
Arrange the eigenvectors in decreasing order of eigenvalue. The first M eigenvectors in the sequence define 
the best M-dimensional subspace to use in filtering the noise from the data. Once increasing the dimension 
of the subspace adds eigenvectors whose eigenvalues are <C 1 the gain for the extra computing burden is 
marginal. 

The advantage of optimal subspace filtering when fitting mosaiced data is obvious. One wishes to cover 
the u — V plane fully with the small effective apertures A obtained after summing the individual pointings. 
Thus each pointing oversamples the u — v plane considerably in terms of its larger "unmosaiced" A, and these 
visibilities are highly correlated. The optimal subspace filter identifies which combinations of the visibilities 



'^If the noise is not diagonal then the appropriate matrix is A'^ ^/'^CN ^/-^ where N^f^ is any square root of the noise 
correlation matrix, for example that defined by Cholesky decomposition. 
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measure primarily the cosmological signal and which are mostly noise, allowing an optimal weighting to be 
given to the individual visibilities sets while retaining the processing time advantage of a smaller data set. 



7. Imaging the Microwave Sky 



In addition to power spectrum estimation and parameter constraints, one goal of DASI and CBI is to 
image the microwave sky. Since interferometers don't measure the temperature of the CMB sky directly, it is 
necessary to "invert" the measured visibilities to form a sky image. In general this inversion is not unique (the 
DC level for example remains unconstrained) and so must be regularized. On a field-by-field basis images of 



the sky can be made using the usual methods of synthesis imaging in radio astronomy (e.g. Cornwell 198£) 
and will form a useful data checking tool. 

It is possible however to try to make a larger scale image of the microwave sky using other (statistical) 
techniques. On degree angular scales, such as probed by DASI, regularized images have been made from the 



ACME/MAX (White & Bunn|l995aj), Saskatoon (Tegmark et al. |1997| ) and CAT (Jones |1996D experiments 
by a variety of techniques. 

Perhaps the most straightforward procedure for going from the visibility data to a sky image is Wiener 
filtering. Wiener filtering has been used extensively on CMB and large-scale structure data in the past 



(e.g. Lahav et al. 1994, Bunn et al. 1994, Zaroubi et al. 1995, Bunn, Hoffman & Silk 1996) and the reader is 
referred to those papers for a description of the underlying theory. The implementation of the Wiener filter 



in the basis defined in §3.2 is treated in many of the references given in that section. 



The Wiener filter provides an estimate of the sky temperature at a point as a linear combination of the 
visibility data. If we assume that foregrounds and point sources have been removed from the visibility data 
and that the underlying sky is gaussian, then Wiener filtering is the "optimal" imaging method. Because it 
is linear, under the assumption of a gaussian sky, the error matrix in the map is also easy to calculate. 



To proceed, let us imagine pixelizing the sky into very many small pixels at position x^, a — 1 • • • A^, 



pix- 



This is purely a bookkeeping device which allows us to use a matrix notation for the Wiener filter, the 
continuum limit is obtained trivially with a sum over pixels replaced by an integral. We denote the underlying 
temperature fluctuation on the sky by ta which is related to the jth visibility by Vj — WjoXa -I- Uj where the 
weight matrix Wja is (see Eq. 0) 



" Aj{yia) 



(32) 



and rij is the noise in the j th visibility. We have labelled Aj (x) with a subscript j to allow for the possibility 
of mosaicing. 

On an algorithmic note, it is computationally simpler to split the visibility into its real and imaginary 
parts and modify Wja to have cosine and sine components so that one deals only with real variables. The 
number of data points is then doubled, and the noise covariance matrix must be modified by a factor of 2 
also. We shall use a complex notation, with the understanding that the implementation of the algorithm 
may be in terms of real valued data. 

If we assume that the correlation matrices of the theory and noise are known (or computed from the 
data, see Seljak 1997b ) then the Wiener filtered estimate of the is 

= C%W,^ iW,-,C';,W^, -t- Cl) ' Vk (33) 
where and are the signal (Eq. ||) and noise (Eq. 16) correlation matrices. The combination W -C^ -W^ 
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is essentially of Eq. (|l2D, we have written it in this way to allow the possibility of including other data 
sets as discussed below. Notice that the form of the Wiener filter is "signal/ (signal+noise)" as is usually the 
case. 

An expression similar to Eq. ( p3[ ) can be obtained for the (formal) error correlation matrix of the 
estimates (see the above references) . The most serious problem is that the Wiener filtered image constructed 
in this manner is missing large scale power. This can be included by fitting to another data set, which retains 
the long wavelength modes, at the same time as the visibility data. In this case one extends the data vector 
Vj to include the extra temperature data, with the associated Wja- In the case of a mapping experiment, 
Wja is simply the beam. For most current degree scale experiments it is a beam modulated on the sky by 
a chopping pattern. We will defer a detailed analysis of how well we can reconstruct the sky under various 
observing strategies, and the noise properties of the images, to a future publication where we also plan to 
discuss the effect of sky curvature. 



8. Polarization 



Neither the DASI or CBI will be polarization sensitive initially, however some instruments operating at 
smaller angular scales (e.g. the VLA) are already sensitive to linear polarization. The (linear) polarization 
of the CMB anisotropics is an important theoretical prediction, and can encode a great deal of information 



about the model (for a recent review see Hu & White 1997a) so we discuss it briefly here. Our analysis is 



very similar to the small-angle formalism developed in Seljak (1997a), though we caution the reader that we 



have a different sign convention for the E- and B-modes, see e.g. HSWZ 



We consider here only the small scale limit, so we treat the sky as a plane with a righthanded coordinate 
system upon it so that the sky is the x — y plane. We define polarization in the horizontal (x) and vertical 
(y) directions to be Q > and Q < respectively. Polarization in the x + y and x — y directions is defined 
to be f7 > and U < respectively. In terms of light traveling to us along z the intensity tensor , with 
hj = x,y, is 



(34) 



where the angled brackets indicate an average over a time long compared with the frequency of the wave, Ei 
is the electric field component and crfc are the Pauli matrices. We have neglected Va2 since this corresponds 
to circular polarization, which is not generated cosmologically. Under a rotation by an angle '0 around z the 
temperature is clearly left invariant while Q ±iU transforms as a spin-2 tensor 



(Q ± lU) e=F2''> (Q ± lU) 



(35) 



We expand Q ±iU in basis states known as spin-spherical harmonics ±2Yirn with coefficients af^ in analogy 
with the temperature fluctuations (Seljak & Zaldarriaga 1997; Kamionkowski, Kosowsky & Stebbins 1997j ; 
Hu & White 1997b). We can form states of definite parity, ± aj^, which are called E- and i?-mode 
polarization respectively (not to be confused with the E and B modes of the radiation) and which have 
angular power spectra Cf and Cf like the temperature. In addition the _E-mode of the polarization is 
correlated with the temperature, so there is a fourth power spectrum: Cj^ . 

The spin-2 spherical harmonics are better known in the context of quantum mechanics as Wigner 
functions: 

sYtm cc I?l,.,„ = (£, -s\ n \£, m) (36) 
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where TZ = exp[zd • J] is the rotation operator and J are the angular momentum operators. The case s = 
are the well known spherical harmonics. Thus we can relate the ±2Yim to the usual Y^m through raising 



and lowering operators (Zaldarriaga & Seljak 1997). The raising and lowering operators are differential 
operators since differentiating TZ with respect to its arguments inserts J± oc i iJy inside the bracket in 
Eq. (|3^). In the flat space case the operators are trivially dx ± idy, with an overall normalization from 
the normalization of the angular momentum states. 



Going to the flat space limit (see Appendix) and acting on the Fourier integral with {dx ± idy) 
that the Fourier coefficients and are related to the measured Q and U through 



-Q(x) = J <Pu {Eu cos 29u - Bu sin 29u) e 
-[/(x) = J(fu{EuSm29u + BuCOs29u)e 



27rm-x 



WG find 



(37) 



where the signs reflect the definition of E and B in (HSWZ), and come from differentiating an imaginary 



exponential twice. [We caution the reader that the power spectra output by the program CMBFAST differ 
in the sign of Cf^ from this convention (HSWZ). Obviously Cf^ and Cf^ are invariant under the sign 



change.] If we recall that changing the sign of the polarization rotates the polarization "vector" through 
90° we see that the i?-mode has polarization always tangential in the u — v plane (Seljak 1997a ; it would be 
radial under a sign change). 

We now have 4 non-vanishing power spectra, all diagonal in u, whose diagonal parts we can denote 
by S'^'^ , S^^, S^^ and S'^^ in analogy with Eq. (|^). This is all the formalism we need to allow us 
to reconstruct the power spectrum of temperature and polarization, or provide constraints on theoretical 
models. We proceed as in Hu & White (1997a; see also Zaldarriaga 1997 ) by defining a vector of our data 
Di = (Ti, Qi,Ui; ■ ■ ■ ; T^r, Qn,Un) at each point u^. The likelihood function for this data is analogous to 
Eq. ( p^ ) with a correlation matrix 



Cf/ (X j Mn)A,{n)Y,^'"'{0u)uj''P{9u) 5"^(u) 

a,/3 



(38) 



where X,Y = {T,Q,U) and a,/3 = T,E,B with lo^^ = 1. lo^^ = -cos26l, lu^^ = sin26l, to^^ = -sin26', 
to"- 



'^^ = - cos 261. 



Given a set of measurements one can reconstruct the power spectrum of the temperature or the 
polarization as described in §5.2. Alternatively one can provide upper limits (or measurements) of a 
bandpower for polarization or temperature by calculating C{Q'^) where is the amplitude of a flat power 
spectrum (i.e. (.{(. + l)Ct ^constant) for the TT, TE, EE and BB power spectra. [If information on the 
temperature is absent one merely drops the X — T entries of the correlation matrix above.] If one believes 
that there is no foreground contamination, the cosmological signal on small scales should be dominated by 
i?-mode polarization and so one can set = in evaluating the likelihood. 



9. Discussion 

The theoretical study of CMB anisotropies has advanced considerably in recent years, both in terms of 
theoretical predictions and comparison with observational data. In this paper we have developed some of the 
formalism necessary for the analysis of interferometer data, such as will be returned by VSA, DASI, CBI and 
VLA, in the modern language of anisotropy (temperature and polarization) power spectra. Although the 
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fundamental entities measured by an interferometer differ considerably from those measured in single-dish 
experiments, much of the analysis can be presented in an analogous manner including power spectra, window 
functions and the like. This allows one to use directly the sophisticated analysis techniques which already 
exist for single-dish experiments. 

The formalism presented here is based upon the "flat sky" approximation and is thus best suited to the 
study of small-scale anisotropics. In this regime interferometers can provide high-sensitivity measurements 
of the high-^ peaks in the angular power spectrum, effects of gravitational lensing, the damping tail of the 
anisotropics and proto-galaxy formation and second order anisotropics. For CBI the cosmological signal 
would be higher if the universe had significant spatial curvature since this would shift the features in the 
angular power spectra to smaller angular scales, however for both VSA and DASI significant signal is expected 
simply based on existing measurements. 

Several outstanding problems remain, all associated with extending the framework of this paper to 
larger angular scales where the curvature of the sky and the full 3D nature of the Fourier Transforms become 
important. We intend to return to these issues in a future paper. 



We would like to acknowledge useful conversations with Wayne Hu, Douglas Scott and Max Tegmark. 
We thank Lyman Page for many comments which helped to clarify the paper. M.W. thanks Pedro Ferreira 
for conversations during the initial phases of this work. 



A. Flat Sky Approximation 



The derivation of the correlation function Eq. (0) given in the text is sufficient for the purposes of this 
paper. However, sometimes it is convenient to treat the sky as flat and replace spherical harmonic sums 
with Fourier Transforms at the temperature (rather than the 2-point function) level. We give some details 
of this in this Appendix, the reader is also referred to Zaldarriaga & Seljak (1997). 

In the flat sky approximation we replace the spherical polar coordinates {9, (p) by radial coordinates on 
a plane: r = 2 sin 61/2 « 9 and We can exchange our indices {(,m) for a 2D vector 1 with length £ and 
azimuthal angle ipi and define 



a(l)^ 



47r 



2e- 



(Al) 



with a(— 1) = a*(l). We now expand our temperature field in terms of multipole moments as usual with 



im 



1 

2^ 



didifii a(l) 



2£^ 



(A2) 



where we have replaced the sum over £ with an integal. 

Writing the sum over m in terms of m < and replacing the associated Legendre polynomials with 
Bessel functions using Gradshteyn & Ryzhik ( 1980 ; Eq. 8722(2)) we have 

1 



im 



£d£ dipt a{\) 



Mm- 



m— 1 



i"^Jmi£9) COS rn((/) - ipe) 



(A3) 
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The term in square brackets is simply the Rayliegh expansion of a plane wave so we finally obtain, writing 
1 = 27ru, 

— (x) = ^ a£„jY«™ « / (fu a(u) exp [27rm • x] (A4) 

and of course the two point function for a(u) is diagonal: 

(a* (u)a(w)) = (u _ . (A5) 

These expressions can then be used to derive Eq. (^), though the derivation presented above Eq. (|^) 



uses more controlled approximations. The generalization of Eq. (A4) to polarization is presented in 
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Fig. 1. — (a) The physical configuration of the DASI receivers. The axes are graduated in wavelengths, 
physical separations are obtained by multiplying by A = 1 cm. Each dish has an effective optical diameter 
D = 20A (solid lines). The physical size of the dishes is 25cm (dotted lines) and the base plate for DASI 
is 1.6ni in diameter (dashed line). Correlations are measured between all pairs of horns, so the instrument 
samples 78 baselines. Note the 3-fold symmetry of the array, which means that 26 different it's are sampled, 
(b) The sensitivity of DASI vs ^ = 2ttu for the configuration in (a). The window functions are shown for 
each of the 26 different u's sampled, and the bold line traces the outline of these 26 window functions. 




Fig. 2. — The angular power spectrum of the standard CDM model (solid) in dimensionless units. The 
horizontal line represents the range of angular scales probed by DASI. The dashed and dotted lines are the 
angular power spectrum of fluctuations from unresolved point sources. These have been modeled by taking 
the luminosity function of VLA FIRST (Becker et al. 1996 ) radio sources at 1.5 GHz as provided by Tegmark 
& Efstathiou (1996; Eq. 43) and extrapolating all sources to 30 GHz using ^ j/^" with a = 0, 0.3, 0.7. 
All sources brighter than 30mJy at 30 GHz have been removed. 
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Fig. 3. — The 2D power spectrum per logarithmic interval in u, for a selection of COBE normalized models 
chosen to provide good fits to the large-scale structure data (see text). The solid lines across the top of 

the plot illustrate the range of scales to which DASI and CBI will be sensitive. The left panel shows the 
expected uncertainties per bin for the sCDM model for 1 month of observing of 25 widely separated points 
on the sky. For display purposes we have placed the points on the sCDM curve. Each of the 26 baselines is 
shown and the signal-to-noise is about 1 in the highest-u bin. Though each baseline has independent receiver 
noise, the window functions for neighboring points overlap considerably so the cosmological signal is very 
correlated between points. The right panel shows the result with 6 months of integration using a mosaicing 
strategy covering ~ 400 square degrees of sky (again with 5/ A'' ~ 1 in the final bin). In this case each of 
these points is completely independent. We have also shown the expected error bars for CBI, assuming a 
similar mosaicing strategy. 
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Fig. 4. — The 26 diagonal elements of the correlation matrix and Cj^ for COBE normalized sCDM and 
a pillbox aperture function. The noise is assuming 1 day of operation of DASI and is per visibility. We have 
shown results for 1 pointing, without the instrument rotation necessary to fill the u — v plane. The signal in 
each of these visibilities will be highly correlated due to strongly overlapping window functions (see text). 




Fig. 5. — The aperture function for (a) 3 x 3 square mosaic with spacing D/2 (b) 7 points laid out 

in 3 rows (at the center and vertices of a regular hexagon) with all neighbors separated by D/2. Contours 
mark 1, 2, 3 and 4fT from the peak with 3a shown thick {la = e~^/^ of peak power). The dashed lines 
indicate the unmosaiced aperture of the primary beam, the solid lines the result after mosaicing. 



